-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
#8174: [WIP] Replace ttlib custom Falcon matmuls with ttnn matmuls #8273
base: main
Are you sure you want to change the base?
Conversation
I am still debugging PCC drop from the switches and also need to make sure perf isn't impacted, but I want to get some early feedback from Falcon7B model owners. |
Thanks for doing the cleanup! I guess the main diff is that ttnn and legacy falcon cpp functions generate program config in a different way? Prefill-side, I think this should be ok since we already set explicit program config for perf-impactful matmuls. One note though, there are two paths for prefill currently - optimized and the currently used one. I am preparing a PR for today that will switch to using this optimized version for relevant seq lens (128, 1k and 2k), so it might be good to see the pcc/perf once the switch is made. |
Sounds good. I will rebase and test after your changes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look good, thanks for doing this Brian. FYI @tt-asaigal, matmuls have been changed to ttnn, @TT-BrianLiu will run the multi-chip tests to verify async performance
Passing pipelines:
|
b2cb466
to
4c00724
Compare
4c00724
to
02fd963
Compare
02fd963
to
2ecffa1
Compare
No description provided.